Accuracy of Answers to Cell Lineage Questions Depends on Single-Cell Genomics Data Quality and Quantity
نویسندگان
چکیده
Advances in single-cell (SC) genomics enable commensurate improvements in methods for uncovering lineage relations among individual cells, as determined by phylogenetic analysis of the somatic mutations harbored by each cell. Theoretically, complete and accurate knowledge of the genome of each cell of an individual can produce an extremely accurate cell lineage tree of that individual. However, the reality of SC genomics is that such complete and accurate knowledge would be wanting, in quality and in quantity, for the foreseeable future. In this paper we offer a framework for systematically exploring the feasibility of answering cell lineage questions based on SC somatic mutational analysis, as a function of SC genomics data quality and quantity. We take into consideration the current limitations of SC genomics in terms of mutation data quality, most notably amplification bias and allele dropouts (ADO), as well as cost, which puts practical limits on mutation data quantity obtained from each cell as well as on cell sample density. We do so by generating in silico cell lineage trees using a dedicated formal language, eSTG, and show how the ability to answer correctly a cell lineage question depends on the quality and quantity of the SC mutation data. The presented framework can serve as a baseline for the potential of current SC genomics to unravel cell lineage dynamics, as well as the potential contributions of future advancement, both biochemical and computational, for the task.
منابع مشابه
A Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data
Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...
متن کاملA Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data
Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...
متن کاملComparative genomics of human stem cell factor (SCF)
Stem cell factor (SCF) is a critical protein with key roles in the cell such as hematopoiesis, gametogenesis and melanogenesis. In the present study a comparative analysis on nucleotide sequences of SCF was performed in Humanoids using bioinformatics tools including NCBI-BLAST, MEGA6, and JBrowse. Our analysis of nucleotide sequences to find closely evolved organisms with high similarity by NCB...
متن کاملP-130: Piwil2 Reprograms Human Fibroblasts to Germ Cell Lineage
Background The piwi family genes are highly conserved during evolution and play a crucial role in stem cell self-renewal, gametogenesis, and RNA interference in diverse organisms ranging from Arabidopsis to humans. Piwil2, also known as Hili, is one of the four human homologues of piwi. Piwil2 was found in germ cells of adult testis, suggesting that this gene functions in spermatogonial stem ce...
متن کاملP-64: Germinal Cells Intracytoplasmic Carbohydrate,Lipid and Lipase levels Alter in Longtime Varicocele-Induced Rats
Background: Despite several studies many questions remain regarding how varicocele develops its adverse effect on testicular tissue. Spermatogenesis cell lineage needs carbohydrates as a main source of energy for cell division. Any disruption in glucose transporting system in seminiferous tubules results in remarkable decrease in germinal cells biological function. Therefore present study was c...
متن کامل